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REMARKS 

Claims 20, 29, 34, 35 and 38 are amended. Claims 1-38 remain in the 
application for consideration. In view of the following remarks, Applicant 
respectfully requests withdrawal of the rejections and forwarding of the 
application onto issuance. 

Clarifying Amendment 

The language of claim 20 has been clarified to change the word "a" to "an" 
in accordance with the Office's objection. Applicant respectfully thanks the 
Office for the Office's attention to detail. 

§102 and §103 Rejections 

Claims 21-22 stand rejected under 35 U.S.C. § 102(e) as being anticipated 
by U.S. Patent No. 6,496,802 to Van Zoest et al. (hereinafter "Van Zoest"). 

Claims 35-38 stand rejected under 35 U.S.C. §102(e) as being anticipated 
by U.S. Patent No. 6,269,122 to Prasad et al. (hereinafter "Prasad"). 

Claims 1-6, 9-11, 13, 16, 17, 23-26 and 31 stand rejected under 35 U.S.C. 
§103(a) as being obvious over U.S. Patent No. 5,655,144 to Milne et al. 
(hereinafter "Milne"), in view of U.S. Patent No. 6,442,758 to Jang et al. 
(hereinafter "Jang"). 

Claim 34 stands rejected under 35 U.S.C. § 103(a) as being obvious over 
Van Zoest in view of Jang. 

Claims 7-8, 12, 14-15, 27-28, and 32-33 stand rejected under 35 U.S.C. 
§ 103(a) as being obvious over Milne in view of Jang and Van Zoest. 
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Claims 18-20 stand rejected under 35 U.S.C. § 103(a) as being obvious over 
Milne and Jang in further view of U.S. Patent No. 6,314,569 to Chernock et al. 
(hereinafter "Chernock"). 

Claims 29-30 stand rejected under 35 U.S.C. §103(a) as being obvious over 
Van Zoest and Jang, in further view of Chernock. 

Before discussing the substance of the Office's rejections, a short 
discussion of the §103 standard is provided. Following this, a short discussion of 
Applicant's subject matter is provided, along with a discussion of selected 
references utilized by the Office in making out various rejections in the present 
Office Action. 

The §103 Standard 

In making out a §103 rejection, the Federal Circuit has stated that when one 
or more reference or source of prior art is required in establishing obviousness, "it 
is necessary to ascertain whether the prior art teachings would appear to be 
sufficient to one of ordinary skill in the art to suggest making the claimed 
substitutions or other modification." In re Fine, 5 USPQ 2d, 1596, 1598 (Fed. Cir. 
1988). That is, to make out a prima facie case of obviousness, the references must 
be examined to ascertain whether the combined teachings render the claimed 
subject matter obvious. In re Wood, 202 USPQ 171, 174 (C.C.P.A. 1979). 

Moreover, there is a requirement that there must be some reason, 
suggestion, or motivation from the prior art, as a whole, for the person of ordinary 
skill to have combined or modified the references. See, In re Geiger, 2 USPQ 2d 
1276, 1278 (Fed. Cir. 1987). Additionally, particular findings must be made as to 
the reason the skilled artisan, with no knowledge of the claimed invention, would 
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have selected these components for combination in the manner claimed. See, e.g., 
In ReKotzab, 217 F.3d 1365, 1371, 55 USPQ2d 1313, 1317 (Fed. Cir. 2000). 

It is impermissible to use the claimed invention as an instruction manual or 
"template" to piece together the teachings of the prior art so that the claimed 
invention is rendered obvious. One cannot use hindsight reconstruction to pick 
and choose among isolated disclosures in the prior art to deprecate the claimed 
invention. In re Fritch, 23 USPQ 2d 1780, 1784 (Fed. Cir. 1992). 

A factor cutting against a finding of motivation to combine or modify the 
prior art is when the prior art teaches away from the claimed combination. A 
reference is said to teach away when a person or ordinary skill, upon reading the 
reference, would be led in a direction divergent from the path that the applicant 
took. In re Gurley, 31 USPQ 2d 1130, 1131 (Fed. Cir 1994). 

In order for a prima facie case of obviousness to be made, the resulting 
combination or motivation must appear to show or suggest the claimed invention. 
In reNielson, 2 USPQ 2dl525, 1528 (Fed. Cir. 1987). 

In view of the standard for establishing a prima facie case of obviousness, 
Applicant respectfully disagrees with the Office's rejections of the present claims 
and submits that the Office has not established a prima facie case of obviousness. 

Applicant's Disclosure 

Perhaps a good place to start to appreciate the various claimed 
embodiments in the present application is the "Background" section of the 
application. There, various problems associated with visualizations are described. 
Specifically, starting on page 1, line 12, the application states as follows: 
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One problem associated with prior art media players is they all tend 
to display different types of media in different ways. For example, some 
media players are configured to provide a "visualization" when they play 
audio files. A visualization is typically a piece of software that "reacts" to 
the audio that is being played by providing a generally changing, often 
artistic visual display for the user to enjoy. 

*** 

[T]here are problems associated with prior art visualizations. As an 
example, consider the following. One of the things that makes 
visualizations enjoyable and interesting for users is the extent to which they 
"mirror" or follow the audio being played on the media player. Past 
visualization technology has led to visualizations that do not mirror or 
follow the audio as closely as one would like. This leads to things such as a 
lag in what the user sees after they have heard a particular piece of audio. It 
would be desirable to improve upon this media player feature. 



Material in the specification that describes the visualization embodiments 
begins on page 15 at line 18. The references cited by the Office, by and large, do 
not deal with visualizations as that term is contemplated and used in the present 
application. 



The Milne Reference 

Milne describes methods and systems for providing synchronization of the 
timing of various multimedia events, including an audio event. In accordance with 
Milne's disclosure, clock objects are defined and are associated with an internal or 
external source of current time. The clock objects are able to be displayed on a 
display, but can be hidden once their linkages are defined. One or more 
multimedia objects representative of audio, visual or other multimedia events, 
including an audio object, are defined and linked to a particular clock object or 
clock objects. Then, a processor synchronizes the multimedia objects with the 
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associated clock object or objects. Milne instructs that the various multimedia 
events are then performed in synchronization with their associated clocks. 

Milne goes to great lengths to explain how its various clocks can be 
synchronized to, in turn, synchronize associated audio and video. For example, in 
column 9, starting at line 44, Milne describes how audio and video sequences can 
be synchronized, as shown in Fig. 11. In order to synchronize the audio and video 
sequences, Milne instructs that the clocks of two players would be synchronized as 
shown in Fig. 12. There, the audio player is viewed as the master clock and the 
video player is slaved to the audio player so that it always follows the audio 
player. 

Thus, what Milne is concerned with is probably best thought of as 
synchronizing presentation times through techniques that establish relationships 
between clocks associated with different types of media. When viewed in the 
context of the claimed subject matter, it becomes apparent that the various claimed 
embodiments are really concerned with something that is quite different from 
Milne. 

The Claimed Subject Matter Rejected Over Milne 
Claim 1 recites a system for synchronizing a visualization with audio 
samples comprising: 

• one or more audio sources configured to provide audio samples that 
are to be rendered by a media player; 

• an audio sample pre-processor communicatively linked with the one 
or more audio sources and configured to receive and pre-process 
audio samples before the samples are rendered, the pre-processing 
providing characterizing data associated with each sample; and 
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• one or more effects configured to receive the characterizing data and 
use the characterizing data to render a visualization that is 
synchronized with an audio sample that is being rendered by the 
media player. 

In making out the rejection of this claim, the Office argues that Milne 
teaches: 

(1) one or more audio sources configured to provide audio samples, 
and cites to column 16 5 lines 13-19, column 15, lines 48-60; 

(2) one or more effects configured to receive the characterizing 
data and use the characterizing data to render a visualization 
that is synchronized with an audio sample that is being rendered 
by the media player, and cites to column 17, lines 15-63, and 
Figs. 12 and 36; and 

(3) an audio sample pre-processor communicatively linked with the 
one or more audio sources and configured to receive and pre- 
process audio samples before the samples are rendered to 
provide characterizing data associated with each sample, and 
cites to column 19, lines 1-11. 

The Office then states that Milne does not expressly teach a pre-processor 
and relies on Jang, citing to element 117 described in column 11, lines 1-2 and 
Fig. 6. 

Based on these two references, the Office then argues that it would be 
obvious to combine the teachings of these references to render the claimed subject 
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matter obvious. As a motivation to combine the references, the Office argues such 
would be obvious "in order to split the audio signal into stereo signals." See, 
Office Action, page 6, second paragraph. 

Applicant respectfully disagrees with the Office's analysis and application 
of the cited references and respectfully traverses the Office's rejections. 

To begin with, Milne neither discloses nor suggests a system that renders a 
visualization that is synchronized with an audio sample, as those terms are 
contemplated and used in the context of the present application. 

Consider first what a visualization is and how it is synchronized with an 
audio sample in accordance with the subject matter of claim 1. It is to be 
appreciated and understood that the description that is given below is not intended 
to be used to limit application of the subject matter of claim 1 to anything other 
than the specific language and terms that appear in claim 1 . As such, aspects of 
the discussion below simply illustrate features that can be utilized to implement a 
system that can comprise components recited in claim 1 . 

Specifically, a visualization, as noted in the "Background" section, is 
typically a piece of software that "reacts" to the audio that is being played by 
providing a generally changing, often artistic visual display for the user to enjoy. 
See, e.g. Specification, page 1, lines 12-17. A good illustration of an exemplary 
visualization in the form of "electricity" appears in Fig. 6 in the rendering area of 
the middle media player. 

Now consider the description of visualizations that appears in the 
Specification starting on page 15, line 18, an excerpt of which appears directly 
below: 
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Fig. 8 shows one embodiment of a system configured to accurately 
synchronize a visual representation with an audio waveform generally at 
800. System 800 comprises one or more audio sources 802 that provide the 
audio waveform. The audio sources provide the audio waveform in the 
form of samples. 

**# 

An audio sample preprocessor 804 is provided and performs some 
different functions. An exemplary audio sample preprocessor is shown in 
more detail in Fig. 9. 

The Specification then goes on to describe one specific implementation 
example, aspects of which are recited in claim 1. Specifically, as noted in the 
Specification starting on page 16 at line 1 1 : 

Referring both to Figs. 8 and 9, as the audio samples stream into the 
preprocessor 804, it builds and maintains a collection of data structures 
indicated generally at 806. Each audio sample that is to be played by the 
media player has an associated data structure that contains data that 
characterizes the audio sample. These data structures are indicated at 
806a, 806b, and 806c. The characterizing data is later used to render a 
visualization that is synchronized with the audio sample when the audio 
sample is rendered. (Emphasis added). 

**# 

Preprocessor 804 also preprocesses each audio sample to provide 
characterizing data that is to be subsequently used to create a 
visualization that is associated with each audio sample. In one 
embodiment, the preprocessor 804 comprises a spectrum analyzer module 
902 (Fig. 9) that uses a Fast Fourier Transform (FFT) to convert the audio 
samples from the time domain to the frequency domain. The FFT breaks 
the audio samples down into a set of 1024 frequency values or, as termed in 
this document, "frequency data." The frequency data for each audio sample 
is then maintained in the audio sample's data structure. In addition to 
maintaining the frequency data, the preprocessor 804 can include a 
waveform analysis module 904 that analyzes the audio sample to provide 
waveform data. The preprocessor 804 can also includes a stream state 
module 906 that provides data associated with the state of the audio stream 
(i.e. paused, stopped, playing, and the like). (Emphasis added). 
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Referring specifically to Fig. 8, a buffer 808 can be provided to 
buffer the audio samples in a manner that will be known and appreciated by 
those of skill in the art. A renderer 810 is provided and represents the 
component or components that are responsible for actually rendering the 
audio samples. The renderer can include software as well as hardware, i.e. 
an audio card. 

Fig. 8 also shows audio rendering object or VisHost 608. Associated 
with the audio rendering object are various so-called effects. In the 
illustrated example, the effects include a dot plane effect, a bar effect, and a 
ambience effect. The effects are essentially software code that plugs into 
the audio rendering object 608. Typically, such effects can be provided by 
third parties that can program various creative visualizations. The effects 
are responsible for creating a visualization in the unified rendering area 
406. (Emphasis added). 

In the illustrated and described embodiment, the audio rendering 
object operates in the following way to ensure that any visualizations that 
are rendered in unified rendering area 406 are synchronized to the audio 
sample that is currently being rendered by renderer 810. The audio 
rendering object has an associated target frame rate that essentially defines 
how frequently the unified rendering area is drawn, redrawn or painted. As 
an example, a target frame rate might be 30 frames per second. 
Accordingly, 30 times per second, the audio rendering object issues what is 
known as an invalidation call to whatever object is hosting it. The 
invalidation call essentially notifies the host that it is to call the audio 
rendering object with a Draw or Paint command instructing the rendering 
object 608 to render whatever visualization is to be rendered in the unified 
rendering area 406. When the audio rendering object 608 receives the 
Draw or Paint command, it then takes steps to ascertain the preprocessed 
data that is associated with the currently playing audio sample. Once the 
audio rendering object has ascertained this preprocessed data, it can issue a 
call to the appropriate effect, say for example, the dot plane effect, and 
provide this preprocessed data to the dot plane effect in the form of a 
parameter that can then be used to render the visualization. 



The notion of "characterizing data" is further discussed on page 19, starting 
at line 23. Specifically, the audio samples are pre-processed to provide 
characterizing data. One desirable feature of characterizing data is that it 
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provides some measure from which a visualization can be rendered. Thus, the 
characterizing data associated with the audio samples is used to render a 
visualization that is synchronized with the audio samples. 

The Office argues that Milne discloses an audio sample pre-processor that 
pre-processes audio samples to provide characterizing data associated with each 
sample. The excerpt relied on by the Office (i.e. column 19, lines 1-11) is 
reproduced below in its entirety: 

FIG. 36 is an illustration of various clock objects linked together and 
multimedia objects in accordance with a preferred embodiment. The 
linkages are created using a cursor to rubber band a geometric figure, such 
as a line segment, to join up a clock object 3610 to another clock object 
3620, or multimedia objects 3630 and 3640. FIG. 37 is an illustration of a 
visual object synchronized with an audio object in accordance with a 
preferred embodiment. The visual clock object 3710 is synchronized with 
the audio clock object 3720 to control the associated multimedia 
presentation of music and displays represented by the animation 
multimedia object 3730. 

Applicant has studied Milne in its entirety and can find no disclosure or 
suggestion of a system comprising, inter alia, an audio pre-processor configured 
to pre-process audio samples to provide characterizing data associated with each 
sample, and one or more effects configured to receive the characterizing data and 
use the characterizing data to render a visualization that is synchronized with an 
audio sample that is being rendered. 

Additionally, the Office's reliance on Jang is misplaced for a couple of 
different reasons. First, it appears that any similarities between Jang's "audio pre- 
processor" and the presently recited audio sample pre-processor are in name only. 
That is, Jang's pre-processor appears to be provided and used for an entirely 
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different reason — i.e. in connection with its MPEG Packet Processor 108, and not 
to provide characterizing data associated with audio samples. Second, the 
Office's reliance on Jang and its stated motivation to combine Milne and Jang 
appear to be based on hindsight reconstruction that has utilized Applicant's 
disclosure as a temple to hunt through the prior art - an act that has been 
specifically proscribed by the Federal Circuit. Further, from a technical 
standpoint, it is unclear at best why one would incorporate Jang's pre-processor 
with Milne's system. 

Applicant respectfully submits that the Office has failed to establish a 
prima facie case of obviousness. Accordingly, for at least this reason, claim 1 is 
allowable. 

Claims 2-8 depend from claim 1 and are allowable as depending from an 
allowable base claim. These claims are also allowable for their own recited 
features which, in combination with those recited in claim 1, are neither disclosed 
nor suggested in the references of record, either singly or in combination with one 
another. Given the allowability of these claims, the rejection of claims 7 and 8 
over the combination with Van Zoest is not seen to add anything of significance. 

Claim 9 recites a media player comprising: 

• an audio sample pre-processor configured to receive and pre-process 
audio samples before the samples are rendered by the media player, 
the pre-processing providing frequency data associated with each 
sample; and 

• one or more effects configured to receive the frequency data and use 
the frequency data to render a visualization that is synchronized with 
an audio sample that is being rendered by the media player. 
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• 



In making out the rejection of this claim, the Office argues that Milne 
discloses "one or more effects" as recited and cites to column 17, lines 15-63. 
Further, the Office argues that Milne discloses an audio sample pre-processor as 
recited and cites to column 16, lines 13-39, column 15, lines 48-60, and column 
19, lines 1-11. The Office then apparently relies on Jang in much the same way as 
it relied on Jang to make out the rejection of claim 1. Applicant respectfully 
disagrees that traverses the Office's rejection. 

Specifically, Applicant has studied Milne and Jang in great detail and can 
find no disclosure or suggestion of a media player that comprises an audio sample 
pre-processor configured to pre-process audio samples to provide frequency data, 
and one or more effects configured to receive the frequency data and use the 
frequency data to render a visualization that is synchronized with an audio sample 
that is being rendered. The Office has failed to establish a prima facie case of 
obviousness. Accordingly, this claim is allowable. 

Claims 10-12 depend from claim 9 and are allowable as depending from an 
allowable base claim. These claims are also allowable for their own recited 
features which, in combination with those recited in claim 9, are neither disclosed 
nor suggested in the references of record, either singly or in combination with one 
another. Given the allowability of these claims, the rejection of claim 12 over the 
combination with Van Zoest is not seen to add anything of significance. 

Claim 13 recites a system for synchronizing a visualization with audio 
samples comprising: 

• an audio sample pre-processor configured to receive and preprocess 
audio samples before the samples are rendered by a renderer that 
comprises part of a media player, the audio sample preprocessor 
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preprocessing the samples to provide characterizing data associated 
with each sample, the characterizing data comprising a timestamp 
associated with each audio sample, the timestamp being assigned in 
accordance with when the audio sample is calculated to be rendered 
by the renderer; 

• multiple data structures configured to hold the characterizing data, 
each data structure being associated with an audio sample; 

• an audio rendering object configured to call the audio sample pre- 
processor to ascertain the characterizing data associated with an 
audio sample that is currently being rendered by the renderer; 

• the audio sample pre-processor being configured to ascertain said 
characterizing data by querying the renderer for a time associated 
with the currently-rendered audio sample, and then using said time 
to identify a data structure having a timestamp that is nearest in 
value to said time; and 

• one or more effects configured to receive characterizing data that is 
associated with the data structure having the timestamp that is 
nearest in value to said time, and use the characterizing data to 
render a visualization that is synchronized with the audio sample that 
is being rendered by the renderer. 



In making out the rejection of this claim, the Office argues that Milne 



discloses: 



(i) 



an audio rendering object as recited and cites to column 



6, lines 21-33 and column 8, lines 32-39; 



(2) 



one or more effects as recited and cites to column 17, 



lines 15-63; 



(3) 



an audio sample pre-processor as recited and cites to 



column 9, lines 43-61, column 8, lines 40-65, column 16, 



lines 13-39, column 15, lines 48-60, and column 16, 



lines 40 to column 17, line 12. 
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The Office then argues that Milne does not specifically teach a pre- 
processor and then relies on Jang in much the same way as it relied on Jang to 
make out the rejections of the claims discussed above. Applicant respectfully 
disagrees with the Office's assessment and combination of the references. 
Applicant respectfully submits that the Office has not established a prima facie 
case of obviousness. Specifically, nowhere does Milne disclose or suggest a 
system comprising an audio sample pre-processor configured to preprocess audio 
samples to provide characterizing data, and one or more effects configured to 
receive characterizing data and use the characterizing data to render a visualization 
that is synchronized with the audio sample. For this reason alone, the claim is 
allowable. 

Claims 14-20 depend from claim 13 and are allowable as depending from 
an allowable base claim. These claims are also allowable for their own recited 
features which, in combination with those recited in claim 13, are neither disclosed 
nor suggested in the references of record, either singly or in combination with one 
another. Given the allowability of these claims, the rejections of claims 14-15 
over the combination with Van Zoest, and of claims 18-20 over Chernok are not 
seen to add anything of significance. 

Claim 23 recites a method of providing a visualization comprising: 

• receiving multiple audio samples; 

• pre-processing the audio samples before they are rendered by a 
media player renderer, the pre-processing providing characterizing 
data for each sample; 

• determining when an audio sample is being rendered by the media 
player renderer; and 
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• responsive to said determining, using the characterizing data that is 
associated with the audio sample that is being rendered to provide a 
visualization. 

In making out the rejection of this claim, the Office argues that Milne 
discloses: 

(1) receiving multiple audio samples and cites to column 16, 
lines 13-39 and column 15, lines 48-60; 

(2) determining when an audio is being rendered as recited 
and cites to column 19, lines 1-11 and column 17, lines 
15-63; 

(3) responsive to determining, using the characterizing data 
that is associated with the audio sample that is being 
rendered to provide a visualization, and cites to column 
17, lines 15-63; and 

(4) preprocessing the audio samples before they are rendered 
by a media player to provide characterizing data, and 
cites to column 16, lines 13-39, and column 15, lines 48- 
60. 

The Office then relies on Jang's teaching of a pre-processor and argues that 
it would be obvious to combine the two teachings to render the subject matter of 
this claim obvious. Applicant respectfully disagrees with the Office's 
interpretation and application of these references. Specifically, Milne does not 
teach, as the Office argues, a method that pre-processes audio samples to provide 
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characterizing data for each sample, and then uses the characterizing data to 
provide a visualization. Accordingly, for at least this reason, the Office has failed 
to establish a prima facie case of obviousness and this claim is allowable. 

Claims 24-28 depend from claim 23 and are allowable as depending from 
an allowable base claim. These claims are also allowable for their own recited 
features which, in combination with those recited in claim 23, are neither disclosed 
nor suggested in the references of record, either singly or in combination with one 
another. Given the allowability of these claims, the rejection of claims 27-28 over 
the combination with Van Zoest is not seen to add anything of significance. 

Claim 31 recites a method of providing a visualization comprising: 

• calling an audio sample pre-processor for characterizing data that is 
associated with an audio sample that is currently being rendered by a 
media player renderer; 

• calling the media player renderer for a time associated with a 
currently-rendered audio sample; 

• using the time to select a data structure containing characterizing 
data associated with the currently-rendered audio sample; and 

• providing the characterizing data to a component for rendering a 
visualization. 

In making out the rejection of this claim, the Office argues that Milne 
discloses the various acts recited in this claim and cites to various sections of 
Milne in support therefore. The Office then relies on Jang in much the same way 
it relied on Jang to reject the above-discussed claims. Applicant respectfully 
disagrees with the Office's interpretation and application of Milne. Specifically, 
Milne neither discloses nor suggests the subject matter of this claim. For example, 
this claim recites "calling an audio sample pre-processor for characterizing data 
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that is associated with an audio sample" and "providing the characterizing data to 
a component for rendering a visualization." 

As noted above, neither Milne nor Jang disclose or suggest the use of 
characterizing data as that term is contemplated in the Specification and recited in 
this claim. Accordingly, the Office has failed to establish a prima facie case of 
obviousness. Accordingly, for at least this reason, this claim is allowable. 

Claims 32-33 depend from claim 3 1 and are allowable as depending from 
an allowable base claim. These claims are also allowable for their own recited 
features which, in combination with those recited in claim 3 1 , are neither disclosed 
nor suggested in the references of record, either singly or in combination with one 
another. Given the allowability of these claims, the rejection of claims 32-33 over 
the combination with Van Zoest is not seen to add anything of significance. 



The Claimed Subject Matter Rejected Over Van Zoest 

Claims 21-22 stand rejected under 35 U.S.C. § 102(e) as being anticipated 

by Van Zoest. Claim 21 recites a system for processing audio samples 

comprising: 

• a timestamp module for assigning timestamps to audio samples that 
are to be rendered by a media player renderer; 

• a spectrum analyzer for processing the audio samples to provide 
frequency data associated with the audio samples; 

• multiple data structures each of which being associated with an 
audio sample, the data structures each containing timestamp data and 
frequency data for its associated audio sample; and 

• the system being configured to use the timestamp data to ascertain a 
data structure associated with an audio sample that is currently being 
rendered by the media player renderer and provide the frequency 
data associated with that audio sample so that the frequency data can 
be used to render a visualization associated with that audio sample. 
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In making out the rejection of this claim, the Office argues that Van Zoest 
discloses: 

(1) a timestamp module as recited, citing to column 4, lines 57- 
65; 

(2) a spectrum analyzer as recited, citing to column 16, lines 9- 
43; 

(3) multiple data structures as recited, citing to column 9, lines 
24-65, column 4, lines 57-65, and column 16, lines 9-14; and 

(4) a system configured to use the timestamp data to ascertain a 
data structure associated with an audio sample and provide 
the associated frequency data so that the frequency data can 
be used to render a visualization associated with that audio 
sample, citing to column 18, lines 55 through column 19, line 
12. 

Applicant respectfully disagrees with the Office's interpretation of this 
reference and therefore traverses the rejection. 

Van Zoest discloses systems and methods for providing access to electronic 
works over a network. In Van Zoest 5 s system, a user can request access to a 
particular work, such as songs, movies and albums. Before such a request is 
granted, the Van Zoest's system verifies that the user is authorized to access to the 
work. In one of Van Zoest's embodiments, the user verifies that they are 
authorized to access an electronic copy of the work by demonstrating that they 
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possess a physical copy of the work. If a user's authorization is verified, the user 
may then be provided with access to an electronic copy of the work by such 
techniques as downloading or streaming. 

With respect to the verification processing, Van Zoest describes one 
approach, which is cited by the Office as anticipating the spectrum analyzer 
recited in this claim, see, e.g. column 16, lines 9-43, which is reproduced in its 
entirety below, along with preceding text to provide context: 

In a preferred embodiment, such a comparison may make sure it is 
comparing like data with many well-known techniques, such as correlation 
and/or phase shifting. Specifically, the Verification Server 141 requests a 
portion of data from a client machine. The client machine collects this data 
from a physical work and sends this data to the Distributor Location 100. 
The Verification Server 141 receives the corresponding data collected from 
the physical work and determines whether this data can match up with the 
data of one or more stored electronic works. To reduce possible problems 
caused by mechanical error, the comparison may not match the data exactly 
as it is received. Instead, the Verification Server 141 may first identify 
whether any portion of the sample received from the first work potentially 
matches any portion of a stored electronic work or sample of a stored 
electronic work. Once it identifies the corresponding portions of data for 
maximum correlation, the Verification Server 141 may use correlation 
and/or phase shifting techniques to manipulate the data so that the 
comparison is not thrown off by mechanical inaccuracies, such as the 
inaccuracy of the user's personal computer reading a CD. 

For example, as shown in FIG. 5, the customer sample may need to 
be shifted forward or backward to perform a more accurate verification. 
The results of shifting one of the samples across time is shown in FIG. 6. 
Once the Verification Server 141 identifies the corresponding portions of 
the samples with maximum correlation, Verification Server 141 can size 
down the two waveforms until they contain the same portion of the track, as 
shown in FIGS. 6 and 7. 

Once the most closely corresponding portions of the samples or 
works are identified, the verification process compares these possibly 
corresponding portions of the samples. In a preferred embodiment, the 
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verification process runs a Fast Fourier Transform ("FFT") algorithm on 
each WAV samples to generate their respective power spectrums. The 
computer can then compare these two spectrums. 

The Verification Server 141 performs the FFT. The FFT provide a 
frequency analysis of the data. The Verification Server 141 compares either 
or both channels of a stereo audio file and averages the results. When the 
data on the physical work matches the data on the electronic work perfectly, 
the difference between the audio files in the power spectrum can be 0.000. 
Conversely, if a CD is badly scratched at this specific location on the disk 
or the physical work is different from the electronic work, the two data may 
not compare too well. However, in a preferred embodiment, a scratch 
should only affect one or two samples of the 20 samples so that the system 
could still recognize the match. Additionally, as an alternative to FFT, the 
Verification Server 141 can perform direct waveform comparisons of the 
sampled and stored data. 

If the received information identically or substantially matches only 
one stored CD, then the Verification Server 141 automatically stores a 
pointer to the formatted version of this title in the client's personal account. 
Otherwise, the Verification Server 141 asks for additional data (e.g., send 
the seven sectors of data found on track four at the tenth second of playing 
time). The system repeats this process until it has received an acceptable 
amount of data to verify or reject the CD in the user's machine or until it 
completes a predetermined set of requests. The number of requests for 
information by the Verification Server 141 can vary as necessary, as can the 
number of channels of data and sampling rate. 



Thus, what is described in this excerpt is a verification process that seeks to 
verify that a user is authorized to receive a particular work. It does this by 
comparing aspects of a user's physical work, with aspects of an electronic work 
desired for receipt by a user. 

The Office then argues that Van Zoest anticipates the subject matter of item 
(4) above and cites to column 18, line 55 through column 19, line 12 in support 
therefore. The entirety of this excerpt is provided below for the convenience of 
the Office: 
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Once a work is loaded or accessible to a user, the user can access 
those works through a variety of graphical user interfaces and organization 
models, such as a juke box, alphabetically, by artist, by type of work, by 
category of work, by verification date, etc.. In a preferred embodiment, the 
works are organized as shown in FIGS. 8-19. 

The user may be provided with access to the electronic works 
identified in their account by downloading, streaming, email enclosure, or 
many other well-known techniques. Such access may be provided via the 
user's personal computer, cell phone, personal digital assistant, pager, car 
stereo, television or any media player. Before providing such access, the 
Content Delivery System 152 may attempt to identify what media player 
the user is accessing the works with. These characteristics may be 
automatically provided to the Distributor Location 100 as a header or some 
other information field included with the user data. The Distributor 
Location 100 could also request the information automatically from the 
user's device or the user. If the information is requested from the user, the 
user could identify their device by manually entering "cell phone" into a 
blank field or selecting their device from a list of available devices. In an 
alternative embodiment, it may also be able to identify the user's device to 
accommodate for geographically specific server farms. 



The subject claim language that the Office argues is anticipated by this 
excerpt is as follows: 

the system being configured to use the timestamp data to ascertain a 
data structure associated with an audio sample that is currently being 
rendered by the media player renderer and provide the frequency data 
associated with that audio sample so that the frequency data can be used to 
render a visualization associated with that audio sample. 



Nowhere does Van Zoest disclose or suggest a system that provides 
frequency data associated with an audio sample so that the frequency data can be 
used to render a visualization associated with that audio sample. Perhaps this is 
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because of Van Zoesfs divergent use of its frequency data. Specifically, Van 
Zoest uses its frequency data simply to effect a comparison of two particular 
works so that a user can be verified to receive a copy of the work. The presently- 
recited subject matter, on the other hand, incorporates the frequency data in the 
process of rendering a visualization that is associated with a particular audio 
sample. 

Accordingly, for at least this reason, this claim is allowable. 

Claim 22 depends from claim 21 and is allowable as depending from an 
allowable base claim. This claim is also allowable for its own recited features 
which, in combination with those recited in claim 21, are neither disclosed nor 
suggested in the references of record, either singly or in combination with one 
another. 

Claim 29 has been amended and recites a method of providing a 
visualization comprising [added language appears in bold italics]: 

• receiving multiple audio samples; 

• pre-processing the audio samples before they are rendered by a 
media player renderer, the pre-processing comprising at least (1) 
using a Fast Fourier Transform to provide frequency data associated 
with the samples, and (2) associating a timestamp with each sample; 

• maintaining frequency data and a timestamp for each sample in a 
data structure; 

• determining when an audio sample is being rendered by a media 
player renderer by: 

• ascertaining a time associated with a currently-rendered sample; and 

• selecting a data structure having a timestamp that is nearest the time; 
and 

• providing frequency data associated with the selected data structure 
to a component configured to use the frequency data to render the 
visualization. 
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In making out the rejection of this claim, the Office argues that Van Zoest 
discloses the recited acts of "receiving", "maintaining", and "pre-processing". 
The Office then argues that Jang teaches the act of "pre-processing" and further, 
that Chernock teaches the acts of "determining", "selecting", and "providing". 
The Office then argues that it would be obvious to combine these teachings to 
render the subject matter of this claim obvious. Applicant respectfully disagrees 
with the Office's interpretation and application of these references. 

Applicant has made a clarifying amendment to this claim to correct an 
antecedent basis issue. Specifically, the term "characterizing" in the last clause of 
the claim has been replaced with the term "frequency" to provide a consonant 
meaning within the claim. Additionally, the use of the frequency data has been 
clarified to recite that the frequency data is used to render the visualization. 

Applicant submits that none of the references either singly or in 
combination with one another teach or suggest the subject matter of this claim. 
Accordingly, for at least this reason, this claim is allowable. 

Claim 30 depends from claim 29 and is allowable as depending from an 
allowable base claim. This claim is also allowable for its own recited features 
which, in combination with those recited in claim 29, are neither disclosed nor 
suggested in the references of record, either singly or in combination with one 
another. 

Claim 34 has been amended and recites one or more computer-readable 
media having computer-readable instructions thereon which, when executed by a 
computer, cause the computer to [added language appears in bold italics]: 
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• pre-process audio samples using a Fast Fourier Transform to provide 
frequency data, the audio samples being pre-processed before they 
are rendered by a media player renderer; 

• query for frequency data that is associated with an audio sample that 
is currently being rendered by the media player renderer; 

• query for a time associated with the currently-rendered audio 
sample; 

• use the time to select a data structure containing frequency data 
associated with the currently-rendered audio sample; and 

• provide the frequency data to a component that uses the frequency 
data for rendering a visualization. 



In making out the rejection of this claim. The Office argues that Van Zoest 
discloses, inter alia, providing frequency data to a component for rendering a 
visualization. Applicant respectfully disagrees. Van Zoest discloses simply using 
frequency data in a verification process that is tangential, at best, to any rendering 
that takes place in its system. Nonetheless, Applicant has clarified the present 
claim to recite that the instructions cause a computer to "provide the frequency 
data to a component that uses the frequency data for rendering a visualization." 
None of the references disclose or suggest this subject matter either singly or in 
combination with one another. The Office's reliance on Jang is misplaced, 
particularly in view of the clarification made by Applicant. 

Accordingly, for at least this reason, this claim is allowable. 



The Claimed Subject Matter Rejected Over Prasad 
Claim 35 has been amended and recites a method of providing a 
visualization comprising [added language appears in bold italics]: 



defining a frame rate at which visualization frames of a visualization 
are to be rendered, the visualization frames being rendered from 
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characterizing data that is computed from audio samples and 
which is used to create the visualization; 

• setting a threshold associated with an amount of time that is to be 
spent rendering a visualization frame; 

• monitoring the time associated with rendering individual 
visualization frames; 

• determining whether a visualization frame rendering time exceeds 
the threshold; and 

• providing an effective frame rate for rendering visualization frames 
that is longer than the defined frame rate if the determined 
visualization frame rendering time exceeds the threshold. 



In making out the rejection of this claim, the Office argues that the claim is 
anticipated by Prasad, citing to column 6, lines 1-60, column 7, lines 29-59, and 
column 7, lines 1-28 in support therefore. Applicant respectfully disagrees and 
traverses the Office's rejection. 

Prasad discloses methods and apparatus for synchronizing audio and video 
streams in a video conferencing system. During a video conferencing session, 
audio and video streams are transmitted from one processing system to a remote 
processing system, where they are recorded. Because the video stream has a 
variable frame rate during transmission, extra frames are inserted into the recorded 
video stream in order to maintain a constant, predetermined frame rate. During 
playback, synchronization information from the audio stream is provided by an 
audio playback process to a video playback process in order to synchronize the 
start of playing the audio and video streams, as well as to repeatedly synchronize 
the audio and video streams during playback. Thus, the context in which Prasad's 
methods and systems operate is entirely different from the context associated with 
the subject matter of claim 35. 
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In an attempt to clarify the context in which the subject matter of claim 35 
is employed, the claim has been amended. Specifically, the claim has been 
amended to clarify that the act of "defining" is performed to define a frame rate at 
which visualization frames of a visualization are to be rendered. A further 
clarification is provided in that the visualization frames are rendered from 
characterizing data that is computed from audio samples and which is used to 
create the visualization. 

Prasad neither discloses nor suggests any such method. Accordingly, for at 
least this reason, this claim is allowable. 

Claims 36 and 37 depend from claim 35 and are allowable as depending 
from an allowable base claim. These claims are also allowable for their own 
recited features which, in combination with those recited in claim 35, are neither 
disclosed nor suggested in the references of record, either singly or in combination 
with one another. 

Claim 38 has been amended and recites one or more computer-readable 
media having computer-readable instructions thereon which, when executed by a 
computer, cause the computer to [added language appears in bold italics]: 

• set a threshold associated with an amount of time that is to be spent 
rendering a visualization frame for a given frame rate, said 
visualization frame being associated with a visualization that is 
rendered using characterizing data computed from audio samples, 
which characterizing data is used to create the visualization'^ 

• monitor the time associated with rendering individual visualization 
frames; 

• determine whether a visualization frame rendering time exceeds the 
threshold; and 

• provide an effective frame rate for rendering the visualization that is 
longer than the defined frame rate if the determined visualization 
frame rendering time exceeds the threshold. 
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In making out the rejection of this claim, the Office argues that the claim is 
anticipated by Prasad, citing to column 6, lines 1-60, column 7, lines 1-28, and 
column 5, line 48 through column 6, line 60 in support therefore. Applicant 
respectfully disagrees and traverses the Office's rejection. 

As noted above, Prasad discloses methods and apparatus for synchronizing 
audio and video streams in a video conferencing system. Thus, the context in 
which Prasad's methods and systems are employed is quite different from the 
context in which the subject matter of claim 38 is employed. In an attempt to 
clarify the context of the presently-claimed subject matter, this claim has been 
amended to recite that the visualization frame is associated with a visualization 
that is rendered using characterizing data computed from audio samples, and 
that the characterizing data is used to create the visualization. 

As Prasad neither discloses nor suggests any such subject matter, this claim 
is allowable. 
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Conclusion 

All of the claims are in condition for allowance. Applicant respectfully 
requests a Notice of Allowability be issued forthwith. If the Office's next 
anticipated action is to be anything other than issuance of a Notice of Allowability, 
Applicant respectfully requests a telephone call for the purpose of scheduling an 
interview. 



Respectfully Submitted, 




By: 




Reg. No. 38,605 
(509) 324-9256 
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